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Abstract 


^  u  i  1  "  ' 

We  investigate  the  problem  of  selecting  the  ‘best'  one  of  k  arbitrary 
systems  or  alternatives.  Consider  one  observation  from  each  of  the  k 
systems.  By  'best,'  Me  mean  that  system  Mhich  has  the  highest 
probability  of  yielding  the  'most  desirable'  of  the  k  observations. 
The  term  'most  desirable'  is  defined  according  to  some  criterion  of 
goodness  determined  by  the  experimenter »  We  show  that  this  problem 
can  be  formulated  as  a  multinomial  selection  problem.  Hence, 
multinomial  selection  procedures  are,  in  a  sense,  nonparametric 
procedures.  An  up-to-date  survey  of  such  multinomial  procedures  is 
given.  Further,  xe  describe  how  some  of  these  procedures  can  be 
adapted  for  use  in  the  simulation  environment. 


1.  Introduct i on 


Consider  k  different  competing  populations  (or  systems  or 
alternatives,  etc).  A  natural  question  to  ask  1st  Which  of  these  k 
systems  is  'best?'  By  'best'  system,  we  could  informally  mean,  e.g. 
-that  one  of  k  inventory  policies  which  maximizes  profit, 

-that  one  of  k  scales  which  is  the  most  precise,  or 
-that  one  of  k  computer  systems  which  has  the  greatest  availability. 
Thus,  'best'  can  take  on  a  variety  of  meanings  depending  on  the 
practical  problem  at  hand. 

Denote  the  k  populations  (sources  of  observations)  as 
,1^, . . . ,Vfe,  respectively.  Suppose  we  take  independent  vectoi — 
observations  . .  ,Xk)  ,  where  is  from  Ij,  i*l,...,k.  Further , 

for  i«l,...,k,  denote 

Pt  -  PtX4  is  the  'most  desirable'  of  XltX2,. . . ,Xk>. 

The  term  'most  desirable'  must  be  defined  according  to  some  criterion 
of  goodness  determined  by  the  experimenter.  Assume  that  nothing  is 
known  beforehand  concerning  the  values  of  the  p.’s.  Obviously,  that 
associated  with  the  largest  of  the  Pj '*  is  the  population  which  has 
the  highest  probability  of  yielding  the  ' most  desirable '  observation 
(of  those  observations  from  the  k- vector)  .  In  this  paper,  our  goal 
will  be  to  find  that  associated  with  the  largest  of  the  p^'s.  We 
refer  to  that  K.  as  the  'best'  population. 

In  order  to  motivate  this  definition,  consider  a  simple  example. 
Let  A  and  B  be  two  (s,S)  inventory  policies.  Profit  is  taken  to  be 
the  criterion  of  desirability.  Suppose  that 

Profit  from  A  *  1000  with  probability  0.001 
■  O  "  0.999 


Profit  from  B  •  0.999  with  probability  1. 

Clearly,  E (Profit  from  A)  *  1  >  0.999  -  E(Profit  fr om  B>|  1.*.,  A 
gives,  the  higher  mvermge  profit.  However,  PfProfit  from  B  >  Pro4it 
4 r om  A)  *  0.999;  there4ore,  B  gives  the  higher  pro4it  almost  mil  of 
the  time .  For  this  reason,  the  experimenter  might  Justifiably 
consider  policy  B  to  be  better  then  policy  A. 

Hence,  it  is  meaning4ul  to  consider  as  ’best'  the  policy  which 
will  most  likely  produce  the  'most  desirable*  observation. 

This  goal  o4  finding  the  'best*  population  can  be  viewed  as  that 
of  finding  that  cell  of  a  k-nomial  distribution  with  the  largest 
underlying  probability.  Suppose  that  we  take  one  observation  from 
each  of  the  k  populations.  Award  a  one  (a  'success')  to  the 
corresponding  to  the  'most  desirable*  of  these  k  observations  (use 
randomization  if  necessary.)  Award  a  zero  to  the  remaining  k-1  Vi's. 
Clearly,  this  is  the  same  as  taking  an  observation  from  a  multinomial 
distribution  with  cell  probabilities  p^,...,P|(. 

Thus,  the  problem  of  finding  the  'best'  one  of  k  arbitrary 
populations  can  be  formulated  as  a  problem  of  finding  that  one 
category  of  a  k-nomial  distribution  with  the  highest  underlying 
'success'  probability.  This  implies  that  any  procedure  which  finds 
the  multinomial  cell  associated  with  the  largest  probability  is  a 
nonpar m»e trie  procedure.  Since  most  real-life  systems  do  not  follow 
one  of  the  'usual  *  probability  distributions,  such  nonparametric 
procedures  are  seen  to  be  very  useful.  We  group  these  nonparametric 
procedures  under  the  heading  of  multinomial  selection  procedures . 
Additional  motivation  for  the  above  arguments  can  be  found  in 
Bechhofer  and  Sobel  (1938). 

In  Section  2  of  this  paper,  we  give  a  brief  summary  of  the 


pertinent  notation  and  terminology.  In  Section  3,  aome  o<  the 
existing  eel  action  procedurea  are  preaented.  Section  4  ia  concerned 
with  applications  to  eimulation. 


Mr  now  introduce  notation  end  terminology  which  will  be  useful 
4 or  investigating  the  problem  of  finding  the  multinomial  cell  which 
has  the  largest  cell  probability.  Suppose  that  we  take  independent 
observations  sequentially  from  a  k-nomial  distribution  with  cell 
probabilities  PjtPj** *  * ,pk*  pi  —  £ps  *  1»  until  some  stopping 
criterion  (several  of  which  will  be  given  in  the  sequel)  is  met.  Most 
of  the  procedures  which  we  will  study  take  observations  (up  to  a 
limit,  perhaps)  until  one  cell  has  ‘significantly  more*  successes  than 
the  other  cells.  In  this  case,  the  stopping  criteria  call  for  the 
termination  of  the  procedures. 

Denote  k  .as  the  number  of  observations  from  cell  i  after  t 
multinomial  observations  (or  ‘stages*)  have  been  taken,  i*l,...,kt 
t«l  ,2, . . .  Further,  denote  Ptlj  <  Pt2J  1  -  p[|cl  *s  the  ord®ret* 

Pj's  and  t  <_  ...  <_  ^  as  the  ordered  t**’  AssuRI('  *hat  we 

have  no  a  priori  knowledge  as  to  how  the  are  paired  with  the 

multinomial  cells. 

Our  goal  is  to  select  as  best  that  cell  which  is  associated  with 
^Ckl*  the  largest  probability.  If  the  cell  corresponding  to  ptfc3  la 
actually  chosen,  we  say  that  a  correct  selection  (CS)  has  been  made. 
Also,  it  is  desired  that  the  probability  of  correct  selection  (PCCS>) 

f  f  f  f 

be  at  least  P  whenever  ©  Ptk-ij  1  P[jcj»  where  <P  ,©  >  is  pre¬ 
specified  by  the  user  (with  1  <  ©*  <  •  and  1/k  <  P*  <  1).  Define 

«  te.i©*pCk_n  <  PCk3>-  We  cal1  the  Pref*rence  zon *  *nd 
the  indifference-zone .  [Multinomial  procedures  such  as  those  to  be 
considered  below  fall  under  the  classification  of  so-called 
indifference-zone  selection  procedures .  Another  rich  family  of 
selection  procedures  employs  the  so-called  subset  approach ;  this 


s- 


approach  Mil]  not  b>  emphasized  here.  The  reader  should  refer  to 
Gupta  and  Panchapakrsan  (1979)  for  material  concerning  the 
indifference-zone  and  subset  methodologies. 3 

We  Mill  consider  the  following  configuration  of  p[J3's  •*  a 
benchmark  for  comparison  among  procedures: 

p£k3  «  e*Pm.  i*i,...,k-i  tso 


I.e. ,  p[i3  ■=  (k-lW1,  i«l,...,k-l;  p[kJ  -  ©*  (k-l+e*)~l.  SC  stands 
for  slippage  configuration  (with  slippage  factor  ©*).  For  some 
sampling  procedures  (cf:  Bechhofer,  Elmaghraby,  and  Horse  (1959)), 
this  configuration  of  minimizes  the  PCCS3  over  In  this 

case,  the  SC  is  called  the  least-favorable  configuration  (LFC> - 
Informally,  the  LFC  can  be  viewed  ms  a  'worst  case'  configuration 
(given  that  It  is  not  known  whether  the  SC  is  the  LFC  for  all 

of  the  multinomial  procedures  to  be  presented  in  the  sequel.  However, 
this  is  a  reasonable  conjecture;  we  shall  treat  the  SC  as  if  it  is 
the  LFC.  Since  we  desire  PCCS3  >  P*  for  all  configurations  E.c&q*« 
then  (assuming  the  conjecture  to  be  true)  we  can  equivalently  require 
that  PCCS{^*SC)>P*.  Hence,  it  is  meaningful  to  investigate  the  SC. 

Another  interesting  configuration  is  the  equal  probability 
configuration  (EPC) ,  where  p,  *  1/k  for  all  i.  Of  course,  the  term 
'correct  selection*  is  now  meaningless;  but  the  EPC  is  useful  as 
another  benchmark  in  that  we  would  expect  such  a  configuration  to 
maximize  a  multinomial  procedure's  expected  sample  size  (i.e.,  the 


expected  number  of  multinomial  observations  needed  before  the 
termination  criterion  is  met).  Denote  the  sample  size  for  a  procedure 


P  as  Sp.  E(Sp)  is  the  expected  sample  size. 


Ideally,  Mr  Ml  ah  to  find  a  procedure  Mhlch  guarantee* 
P{CSI^»SC)>P  but  Mhlch  is  also  parsiftionlous  Mlth  observations;  that 
is,  E(6plj>«SC)  and  E<Spl|»«EPC>  should  be  ‘low.* 


3.  Some  Multinomial  Procedures 

In  thi»  section,  we  concentrate  on  indiff erence-zone  procedures 
for  selecting  the  multinomial  cell  which  has  the  largest  probability. 
Recall  that  when  using  the  indifference-zone  approach,  the 
experimenter  must  pre-specify  two  constants,  P*  and  ©*.  The 
procedures  to  be  discussed  below  insure  that 

PfCSS©*pCk  i;j  <  PCk3)  >  P*,  (PR) 

where  PR  stands  for  probability  requirement .  For  all  of  these 
procedures,  we  establish  the  following  conventions: 

-All  observations  are  independent  multinomial  observations. 

**T  is  defined  to  be  the  stage  at  which  the  procedure  in  question 
terminates  sampling.  T  may  be  a  random  variable. 

-We  will  choose  as  best  that  cell  corresponding  to  K[kl  T  (using 
randomization  if  necessary). 

3. 1  A  single-sample  procedure 

The  first  procedure  we  consider  is  that  of  ©echhofer,  Elmaghraby, 
and  Morse  (1959),  denoted  as  Pg^. 

Procedure  P--,*: 

btn 

1.  Specify  k,  P*,  and  ©*. 

2.  Take  observations,  where  «  NBEM(k,P*,©*>  is 

to  be  found  in  the  tables  of  BEM  (1959).  N___  is  the  number  of 

Dtn 

multinomial  observations  which  must  be  taken  in  order  to  satisfy 


-B- 


Remarks  3. It 

1.  Kesten  and  Horse  <1959>  prove  that  the  SC  is  the  LFC. 

2.  In  the  number  of  observations  we  take  is  fixed  at 

^BEM’  ^or  reason,  the  procedure  is  said  to  be  a 

fixed-sample  or  single-sample  procedure. 

S' 

Example  3. 1 . 1 t 

Suppose  that  k«3  and  that  we  specify  P*-0.75  and  ©#«3. 

Use  the  following  table  (abstracted  from  BEH  (1959>>  in  order 
to  find  NBEm. 


nbem 

H  1 
H  ! 

! 

, ! 

1.5 

2.0 

3.0 

1 

.355 

.429 

.500 

.600 

2 

.355 

.429 

.500 

.600 

3 

.362 

.464 

.563 

.696 

4 

.367 

.484 

.594 

.734 

5 

.370 

.496 

.617 

.769 

6 

.374 

.515 

.646 

.804 

Table  1  (for 

PBEM>  * 

• 

PtCSSk*3,f>=LFC>  for  selected  ©*  and 


Reading  down  the  ©**3.0  column,  we  see  that  >(^*5  is  the 
smallest  value  of  which  achieves  the  PR  (Note  that  owing 

to  the  discrete  nature  of  the  multinomial  distribution, 
overshoots  slightly  the  desired  P*«0.75.>  Hence,  if  we  take 


5  observations,  the  PR  Mill  be  gutrantwd.^ 


3.2  P 


BK’ 


an  Improved  version  of  P 


BEH 


P 

BEM 


By  considering  the  following  example,  it  becomes  apparent  that 
is  sometimes  wasteful  with  observations. 


Example  3.2. It 

Suppose  that  k«2,  *nd  Xg* (x ^  5,x2  5>“<^*1>- 

Obviously,  it  is  impossible  to  terminate  sampling  with  x}  T<x^  1 
In  other  words,  there  is  no  chance  for  cell  2  to  be  chosen. 
Since  cell  1  is  guaranteed  to  be  the  victor  regardless  of 
the  remaining  two  observations,  we  should  stop  sampling  at  1*5. 


With  this  example  in  mind,  we  compare  two  procedures,  the  latter  due 
to  Bechhofer  and  Kulkarni  (1983). 


Procedure 

1.  Specify  k  and  N. 

2.  Take  N  observations.^ 

Procedure 

1.  Specify  k  and  N. 

2.  Take  observations  until  either 

A.  The  stage  t«N  or 

B.  t-xCk-lD  (stop  *»nipling  if  the  cell  (s)  with 

the  second  largest  number  of  observations  can  only  tie  the 


cell  corresponding  to  ^ , 


even  if  the  remaining  N-t 


observation*  ere  taken.) 


// 


Remark*  3.2c 


1.  Note  that  PfiK  is  a  sequential  procedure. 

2.  It  is  clear  that  E(6p  )<E(S_  >. 

rBK  “  rBEM 

3.  Bechhofer  and  Kulkarni  show  that  P (CS I  «P{CS  I . 

Thus,  P»BK  preserves  the  P<CS>  of  the  less  parsimonious 
procedure,  />  Hence,  we  can  use  the  more  efficient  P 

with  no  loss  of  PtCS). 


BK 


Example  3.2.2: 

Let  k*3,  P**0.75,  and  6**3.  Then  E(Sp  >*N_  *5.  It  is 

”bem 

straightforward  (but  tedious)  to  show  that  E(Sp  )*3.95 

rBK 


in  the  LJFC. 


// 


3. 3  A  sequential  procedure  due  to  Ramey  and  Alam  (1979) 

Procedure 

1.  Specify  k,  P*,  6*. 

2.  Take  observations  until  either 

*•  “ckl.t""  or 

B.  xr._  .-xr.  . *r ,  where  r  and  N  are  determined  by  k, 

P*,  and  6*,  and  are  to  be  found  in  tables  for  certain  k,  P*, 


and  6  (NB:  See  Remarks  be Jon.) 


// 


Remarks  3.3: 

1.  Ramey  and  Alam's  tables  actually  contain  a  number  of  error*; 


the  user  1  *  advised  to  consult  Bechhofer  end  Goldsman  ( 1984s) 

2.  The  number  of  observations  which  lakes  is  boundtd  by 
kN-k+1. 

3.  It  is  not  known  whether  the  BC  is  the  LFC  for  all  k  for  P 

nl 

but  we  will  make  the  reasonable  assumption  that  this  is  the 
case. 


4.  r  and  N  are  determined  in  such  a  way  that  the  PR  is 
satisfied  and  E(S.  lg“LFC>  is  minimized  over  the  (r,N) 


grid. 


5.  PRft  is  not  directly  comparable  to  PgK.  However,  for  most 

*  a 

choices  of  k,  P  ,  and  G  ,  it  seems  that  PRft  requires  fewer 
observations  (on  the  average)  than  P_K» 


Example  3.3.1: 

*  # 

Again,  let  k*3,  P  *0.75,  and  G  *3.  We  abstract  a  small 
portion  of  the  necessary  (corrected)  tables  for  POA  from 


Bechhofer  and  Goldsaan  (1984a) 


12- 


ss 

& 


P<CS> 


.796 

.760 

.756 

.757 


E  (S> 


3.68 

4.70 

8.80 

18.24 


Table  2  <4or  PRft>  t 

P<CStk«=3,£«LFC>,  E (St . )  fo r  various  P*,  8* 


We  see  that  if  r*2  and  N=3  are  chosen,  a  PCCSl  of  0.796  a*i X 1 

be  achieved  in  the  conjectured  LFC.  The  overshoot  of  the 

PfCS!^=LFC>  <0.796  vs.  P*-0.75>  is  again  due  to  the  discrete 

nature  of  the  problem.  Further,  in  this  exaapie, 

E(S_  :e*UFC>«3.68  <  3.95*E(Sp  ! £*LFC >  .  .. 

RA  bk 


3.4  An  unbounded  sequential  procedure, 


Bechhofer,  Kiefer,  and  Sobel  (1968)  give  an  unbounded  (or 


open )  sequential  procedure  which  satisfies  the  PR. 


Procedure  P^: 

1.  Specify  k,  P*,  ©*. 

2.  Take  observations  until 

f  <l/©*>  Ck3»t  ti3»t  <  (i-p*>/p*. 
!■! 


Remark  3.4t 

1.  BKS  show  that  the  SC  is  the  LFC  for  this  procedure. 


SBS5 s 
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Example  3.4.11 

Let  k>3,  P*»0.75,  ©**3.  Consul ting  the  appropriate  tables  in 

Bechhofer  and  Goldsman  (1984b),  Me  Immediately  find  that 

PCCSIp*LFC>  *  0.842  (.0004)  and  E(SP  l£«LFC>  . 

rBKS 

4.526  (.051).  These  results  are  Monte  Carlo  estimates 
obtained  via  simulation;  the  entries  in  parentheses  are  the 
accompanying  standard  errors.  The  results  are  nearly  exact, 
as  can  be  seen  by  the  small  standard  errors.^ 


IV  -■ 


3.5  *n  improved  version  of  P 


BKS 


As  in  the  above  example,  it  turns  out  that  P^g  frequently  yields 


P(CS!j>*LFC>»P".  This  extra  P(CS>  is  at  the  cost  of  unnecessary 
observations.  Therefore,  Bechhofer  and  Goldsman  (1984b)  give  a 


procedure  Mhich  decreases  the  attained  P(CS>  to  a  level  slightly 


greater  than  P  ,  but  which  also  saves  observations. 


Procedure  PgE* 


1.  Specify  k,  P*,  ©*. 


2.  Take  observations  until  either 


k— 1  _  x  -x  r  .  _ 

A.  J  (1/6  )  Ck3»t  CO,t 

i»l 


<  (1-P  )/P  or 


B.  the  stage  t-Ngg,  where  Ngg  is  determined  by  k,  P  , 

©*,  and  is  to  be  found  in  Bechhofer  and  Goldsman *s  tabli 


for  certain  values  of  k,  P*,  ©*• 


// 


Remarks  3.5t 


1.  Hgg  I •  chosen  the  small est  upper  bound  on  the  total 
number  of  observations  such  that  the  PR  is  satisfied. 

2.  Unlike  Pgg  is  bounded. 

3.  It  is  not  known  whether  the  SC  is  the  LFC  for  this 
procedure,  but  we  so  conjecture. 

4.  PBB  is  neither  directly  comparable  to  PgK  nor  PRft.  For 

many  choices  of  k,  P#,  ©*,  it  seems  that  PRB  requires  fewer 
observations  (on  the  average)  than  The  authors  feel 

that  the  user  should  consult  the  relevant  tables  when 
designing  an  experiment. 

Example  3.S.1: 

Let  k«=3,  P*«=0.75t  0**3.  We  now  abstract  a  small  portion 
of  the  necessary  tables  for  from  Bechhofer  and  Goldsman 

( 1984b) . 


p* 

e* 

nbg 

PCCS> 

E(S) 

.75 

3.0 

5 

.757 

3.48 

.75 

2.4 

8 

.760 

5.59 

.75 

2.0 

13 

.751 

8.18 

.75 

1.6 

32 

.752 

17.80 

Table  3  (for  PTC>  * 

P<CSSk*3,£-LFC>,  E(St.)  for  various  P*f  6* 


We  see  that  we  must  choose  N„»5  with  the  resulting 
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P<CBIe-LFC>-0.757  and  E (6p  t£«LFO«3.4B. 

PBB 

3.6  ,  fen  augmented  version  of  PpG 

We  now  employ  the  same  device  which  was  used  in 
vie.,  we  stop  sampling  when  the  cell  in  second  place  only 
has  a  chance  to  tie. 


Procedure  t 

1.  Specify  k,  P*,  6*. 

2.  Take  observations  until 

A.  J  <l/e*>  tk3»t  <  ci-P*>/P*  or 

i“l 

®*  t*NAl*^BB*  Where  NBB  4*  fr°m  *BB  °r 
C'  Xtkl,t“XCk-U,t  *  NAl_t*// 

Remarks  3.6: 

1.  Clearly,  E(S_  >  <  ECS_  >. 

A1  ~  PBS 

2.  By  reasoning  similar  to  that  given  in  Bechhofer  and 
Kulkarni  (19B3),  PtCS!Pftl>  «  PtCSSPBG>.  That  is,  no  PtCS> 
is  lost  between  the  two  procedures. 

3.  Tables  for  P ^  are  currently  being  prepared.  See 
Remark  3.6.2  above  for  information  concerning  the  PCCS>. 


Example  3.6.1: 

Again,  let  k«3,  P#«0.75,  ©**3.  Then  Nftl«5  and  PtCSS£«LFC>»0.757 

as  before.  Now,  E<Sp  l£«LFC>«3.24  <  3.48-EtS.  l£«LFC)... 

rAl  PBB  " 


3. 7  General  remark* 


We  have  seen  procedure*  which  follow  a  poset  of  sorts  in  term*  of 
sampling  efficiency.  leads  to  the  more  efficient 

Similarly,  l*sd*  to  which,  in  turn,  leads  to  P^ j.  PRA  stands 

alone.  We  note  that  augmentations  may  be  made  to  but  this  makes 

Rn 

our  search  for  the  optimal  combination  of  r,  N,  etc.,  intractable. 


p  _ >  p 

BEM  BK 

PBKS  >  PBG  >  PM 
PRA 

In  lieu  of  work  currently  in  progress,  the  authors  recommend  use  of 
PBA  or  p  when  these  procedures  are  applicable  to  the  situation  at 
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4.  Applications  and  Augmentations  In  Simulation 

Me  are  now  interested  in  the  more  general  problem  of  determining 
which  of  k  arbitrary  populations  is  the  'best.*  Suppose  X^ 

is  an  independent  observation  from  ,  iBi«...,k.  Recall  that  we  can 
correspond  each  of  the  k  l^'s  with  a  cell  of  a  k-nomial  distribution 
with  cell  probabilities  P}f*<Pk(  where 

Pi  ■  PCXj  is  the  'most  desirable*  of  Xj,...,Xk>.  Hence,  the 
multinomial  procedures  described  in  the  last  section  are 
nonpmrmmetr i c.  This  fact  is  of  tremendous  importance  for  simulators 
since  the  underlying  distributions  of  the  I^’s  (i.e.  ,  k  simulated 
systems)  are  frequently  unknown. 

4. 1  An  example 

Suppose  that  we  wish  to  choose  that  one  of  k  different  (s,S) 
inventory  policies  which  will  have  the  highest  probability  of  yielding 
the  maximum  profit  for  a  small  company.  Here,  profit  is  taken  to  be 
the  criterion  of  desirability.  It  is  assumed  that  the  financial 
affairs  of  the  company  are  complicated  enough  such  that  an  analytic 
solution  of  this  problem  is  not  possible.  Thus,  it  is  necessary  to 
resort  to  the  use  of  simulation  and  multinomial  selection  techniques. 

For  the  sake  of  simplicity,  suppose  that  k«3,  P**0.75,  and  ©**3. 
That  is,  we  must  choose  among  three  <s,S)  policies;  it  is  desired 
that  PtCS!pck3  >  ©*pck_l3>,  where  pi  is  the  probability  that  the  i-th 
policy  yields  the  highest  profit  in  a  given  k-vector  observation.  Me 
will  use  procedure 

Me  simulate  each  of  the  three  (s,S)  policies  (with  different 


pseudo-random  number  sequences)  to  obtain  vector  observations 
— l*-2****  YJ»t  *  th*  profit  from  policy  j  on  the  t-th  simulation 

run.  *  <V1  ,t *Y2,t*Y3,t>  *  where  T  is  the  steQe  of 

sampling  at  which  /»AJ  terminates.  After  the  t-th  stage  of  sampling  1 
completed,  identify  the  policy  which  yields  the  highest  profit  among 
tYl,t'Y2lt,V3,t>'  **  necessary,  use  randomization  to  break  ties. 
Increment  the  count  in  the  corresponding  multinomial  cell  by  one. 


Example:  If  Yf  -  (356,422,297),  then  the  highest 

profit  (for  this  vector  observation)  is  realized  by  I2» 
Thus,  the  count  x%  *  <Ki  ,1 **2,1 **3,1*  *  (0,1,0).// 


Take  3-vector  simulated  observations  until  calls  for  the 

A1 


termination  of  sampling.  Recall  from  Section  3.6  that  P 


A1 


terminates  when 

1.  1  (l/©*)***3’1  <  (1-P*)/P*  (  «  i/3)  or 

i«l  ~ 

2.  t  «  Nftl  (  ■  5)  or 

3*  *tkl,t  ”  *tk-13,t  *  NAl"t  (  *  5_t> 


In  the  table  below,  we  continue  the  example.  The  first  column 
S*ves  the  sampling  stage  —  i.e. ,  the  number  of  3— vector  observations 
which  have  been  taken.  In  the  next  three  columns,  the  3— vectors  of 
simulated  data  are  given.  These  are  followed  by  the  corresponding 


multinomial  cell  x 


Stage  t 

Vl*t 

1 

V2.t 

V3.t 

"l.t 

*2,t 

K3,t 

1 

336 

422 

297 

0 

1 

0 

2 

411 

37B 

314 

1 

1 

0 

3 

374 

393 

380 

1 

2 

0 

4 

368 

374 

378 

1 

2 

1 

At  stage  t  «  4,  P  calls  for  procedure  termination  since 
K[kJ  t  -  t  «  Nftl  -  t.  We  choose  policy  two  as  'best,*  since 

that  is  the  policy  corresponding  to  T*// 

4.2  Pseudo-observations 

We  discuss  an  augmentation  of  that  eliminates  populations 

which  seem  to  be  'inferior. '  The  augmentation  takes  advantage  of  the 
possibility  that  in  the  course  of  sampling,  some  of  the  T^'s  will  have 
no  chance  of  'winning*  (being  chosen  as  'best'). 

For  instance,  in  the  example  of  Section  4.1, 
x3  *  (Xj  3**2  3**3  3*  *  (1,2,0). 

Claim:  Given  that  x^  *  (1,2,0),  it  is  impossible  for 

13  to  win  (in  this  example). 

Proof: 

Case  1:  If  xq  «  (2,2,0),  then  only  Ij  and  I2  can 
win  (since  ■  5). 

Case  2:  If  x,  ■  (1,3,0),  then  sampling  terminates 

and  I_  wins  (since  J  (1/©*)  tk3,t  ti3,t  <  (i-p*>/p* 

2  i-1 

■  (1,2,1),  then  I2  wins  (since 


Case  3:  If  x 
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Thu*,  in  thl *  example,  it  i*  pointing  to  sample  from  13  given 
that  *3  •  (1,2,0). 

With  thi*  ex ample  in  mind,  consider  the  following  augmented 

procedure,  P#,  which  no  longer  takes  observations  from  U3*  Suppose 

that  before  the  next  vector  observation  is  taken,  a  U(0,1>  probability 

die  is  rolled.  Let  the  outcome  of  the  roll  be  0<u<l.  Since  the  PR 

must  be  satisfied,  assume  that  <PC  j 3  »P£23  »Pt33>  *  «P.Pt©*P>,  where 

P  c  1/C©  e2).  That  is,  the  underlying  configuration  of  p^'s  is  the  SC 

(the  conjectured  LFC).  If  u<p<l,  award  a  'success'  to  multinomial 

cell  3  Ci.e.,  increment  cell  3's  count  by  one:  x_  *  x_  l) 

3,3 

without  actually  taking  vector  observation  W+.  In  this  case,  we  have 
generously  awarded  I3  a  'free  success,*  and  we  call  this  non¬ 
observation  a  pseudo-observation .  If  u>p,  define  Y.  *  CY  ,Y_  _).  In 
this  case,  we  only  sample  from  and  I2.  Increment  as  usual  the 
count  of  the  cell  corresponding  to  the  'more  desirable'  of  the  two 
observations.  Take  observations  in  this  manner  until  any  of  the 
stopping  criteria  from  PA1  are  met  (where  the  x^  t's  are  defined  as 
above) . 

With  the  example  still  in  mind,  let  B  be  the  event  that  we  are 
using  procedure  the  underlying  configuration  of  the  p.  's  is  the 

SC,  and  1,2,0).  Define  C  similarly  except  that  is  to  be 

replaced  by  P#. 


Claim:  P<CS*.B>  «  P<CS!C>. 

Proof:  Since  we  operate  in  the  SC,  p.  *  p  or  ©*p. 

Ga*«  Is  If  p3  ■  ©*p,  then  cell  3  is  the  correct  cell 
(*ince  ©*  >  1).  However,  it  is  clear  that 


p<cstp3»e  p,  B)  «  p<csip3«e  p,  o  •  o.  t/ 

Cate  2i  Suppose  p3  *  p.  Then  <ps,p„4>  *  (p,8*p)  or 
(B*p,p).  Assume  the  former  subcase.  A  similar  argument 
Mill  apply  for  the  latter.  Consider  P#  and  a  given 
3-vector  observation.  Then  H3  is  awarded  a 
(pseudo-) success  with  probability  p.  Further,  P ,  is 
awarded  a  success  m.  p. 

P(1I3  will  not  get  the  success)  x  P(lj  will  get  the 
success  I  only  Pj  and  p^  are  under  consideration). 

*  (1— p)  x  p/(p+8*p)  ■  p.  Similarly, 

P(I2  will  get  the  success)  *  ©*p.  But:  these  success 
probabilities  are  exactly  the  same  as  those  from 
Since  the  termination  criteria  for  both  procedures  are 
also  identical,  we  have  the  result.  // 

Goldsman  and  Schruben  (1984)  consider  a  more  general 
version  of  P#. 

Procedure  P^s 

1.  Specify  k,  P*,8*. 

For  tsl,2,... 

2.  Let  I.  *  (ilx...  .  -  x .  >  N_,  -  t) 

t  tk),t  l ,t  —  A1 

(This  is  the  set  of  Ij's  that  no  longer  have  a 
chance  to  win.) 

3.  For  each  icl^,  allocate  an  interval  of  length 
p  of  CO, 13,  where  p*l/ (k-l+e*) . 

4.  Roll  a  U(0,1)  random  number,  u. 

5.  If  u  falls  in  an  interval  allocated  for  some  jtl^, 
increment  the  corresponding  x  .  .  by  one  (i.e. ,  award 


v» 


Otherwise*  take  actual 


a  'pseudo-success'  to  H j). 

observations  from  ell  fl^'s  such  that  i t<l ,. . . ,k)\l^. 

Increment  by  one  the  x.  .  corresponding  to  the  'most 

•  «t 

desirable'  observation. 

6.  Terminate  the  procedure  (with  the  usual  decision 
rule)  if  any  of  the  termination  criteria  for 
are  satisfied. 

Remarks  4.2: 

1.  Goldsman  and  Schruben  (1984)  prove  that 

ptcs:pm,  £=lfc>  -  p{cssf»A2*  e«lfc>. 

2.  Clearly,  E(S  :^=EPC)  <  E(Sp  l£=EPC),  where  S  is 

A2  ”ftl 

the  number  of  stages  (in  which  actual  observations  are 
taken)  until  termination.  It  seems  likely  that  this 


relationship  also  holds  when  f>=LFC,  but  this  has  not  yet 
been  proven. 


3.  Tables  for  are  currently  being  prepared.  See 
Remarks  4.2.1  and  3.6.2  for  information  concerning  PCCS>. 

4.  The  trick  of  taking  pseudo-observations  is  particularly 
suited  for  the  simulation  environment. 


Example  4.2.1: 

Again,  let  k«3,  P*«0.75,  6*«3.  Then  P<CS!e*LFC> 

0.757  as  before,  and  E(S^  ! g*LFC)  «  3.12  ^ 

rA2 


4.3  Correlation  induction 

Frequently,  it  is  possible  for  the  simulator  to  artificially 
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induce  (positive)  correlation  among  the  1^'*.  For  instance,  the 
simple  technique  of  common  random  numbers  can  be  used  (when 
applicable).  More  complicated  methods  can  also  be  implemented.  It 
stands  to  reason  that  as  the  correlation  among  the  populations 
increases,  it  becomes  easier  f or  the  experimenter  to  distinguish  which 
of  the  populations  is  the  ‘best. ' 

Consider  the  aforementioned  selection  procedures.  Obviously,  an 
increase  in  ©*  facilitates  the  distinction  of  the  'best'  multinomial 
cell.  The  following  crude  example  illustrates  how  positive 
correlation  induction  can  result  in  increased  6*. 

Example  4.3.1: 

Suppose  that  k*2  and  that  is  distributed  normally 

2 

with  unknown  mean  and  known,  common  variance  c  , 
iel,2.  If  one  observation  is  larger  than  another,  the 
first  observation  is  taken  to  be  the  more  desirable. 

So,  define  p^  *  P(X^  >  X2> ,  p2  «  1  -  Pj.  Suppose 
that  Uj  >  U2;  so  we  can  let  p^  «  ©p  and  p2  *  p,  where 
©  is  some  number  >  1.  Clearly,  ©  *  (l-p)/p.  Finally, 
define  #>  *  Corr(Xj,X2>  0. 

Now,  px  -  P(Xj>X2)  «  P(XX-X2  >  0) 

«  PfCXj-Xj-dij-Ujll/e  >  -Oij-Ujl/el, 

where  «  *  V(2*^(l-p)> 

■  l-fi-iiij-lijj/e)  ■  #((m1-m2>/«), 

where  #(.)  is  the  N(0,1>  cdf 

■  ©pp,  say,  -  1-p. 


Then  ©^  »  (l-p>/p  »  f (n> / C 1-f <n> > « 

where  n  *  <Mj -u^y/v. 

Hence,  ©p/©0  *  t* <*l> /♦  (n *  >  3x t  Cl-f  (f|*  >  > /  tl-*(n>  >  3 , 

where  »» *  «  »hMt-P>. 

This  quantity  is  obviously  >  1.  Thus,  ©^  >  ©^.  ^ 

Remark  4.3: 

More  details  and  examples  are  given  in  Goldsman  and 
Schruben  (1984). 

4.4  Population  splitting 

We  remarked  earlier  that  the  LFC  represents  a  'worst  case' 
configuration  for  gift  *,  the  preference  zone.  Of  course,  such  a  worst 
case  is  rarely  encountered  in  practice.  If,  in  a  simulation  study, 

then  it  is  likely  that  £  is  not  in  the  slippage  configuration. 
Perhaps  it  is  possible  to  take  advantage  of  this  likelihood.  A 
generalization  of  taking  pseudo-observations  is  proposed  Which  avoids 
taking  observations  from  I^'i  which  seem  to  be  'inferior'  to  other 
populations.  Indeed,  in  the  course  of  sampling,  we  partition  the  I/s 
into  a  'good'  set  B  and  a  'bad'  set  B.  The  populations  in  G  receive 
real  observations  while  those  in  B  receive  only  pseudo-observations. 

If  certain  populations  in  G  do  not  garner  many  successes  during 
sampling,  it  is  possible  to  exchange  them  with  populations  from  B. 
This  possibility  of  exchange  gives  all  of  the  k  populations  a  chance 
to  win  while  discouraging  actual  sampling  from  'bad'  *s. 

The  sampling  procedure  we  consider  below  is  almost  purely 
heuristic.  To  facilitate  the  discussion,  we  consider  directly 
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MAphno  from  a  k-nomial  distribution. 

Procedure  Pgt 

1.  Specify  k,P*,  6*,  (some  initial  number  of  k-nomial 
observations) . 

2.  Take  n^  observations. 

3.  Partition  cells  l,...,k  into  'good'  cells  6  and  'bad' 
cells  B.  Suggestion:  Place  the  cells  with  >  the  median 
number  of  successes  (either  real  or  pseudo)  into  G;  put 
the  others  in  B. 

4.  Temporarily  be  conservative  and  assume  that  the  'best* 
cell  is  in  B.  In  the  LFC,  this  cell  Hill  have  probability 
e*p;  P(B>  ■  (SB*  -  l)p  ♦  ©*p.  With  probability  P<B>, 
award  a  pseudo-success  to  a  random  cell  in  B.  If  a 
pseudo-success  is  not  awarded  to  a  cell  in  B,  take  a  real 
multinomial  observation  from  the  cells  in  6  (i.e.,  take 
observations  only  from  I i 's  corresponding  to  cells  in  6). 

5.  If  any  of  the  termination  criteria  from  a  previously 

discussed  multinomial  procedure  (which  must  be  pre¬ 
specified)  are  met,  stop  sampling  and  choose  as  best  that 
cell  corresponding  to  j.  Otherwise,  go  to  3. 

Remarks  4.4: 

1.  Admittedly  heuristic,  this  procedure  is  intuitively 
appeal i ng . 

2.  Goldsman  and  Schruben  (1984)  give  more  details  and  some 
limited  computer  simulations  relevant  to  this  procedure. 
Caveat:  very  little  work  has  been  done  to  date. 
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5.  Summary 

In  this  expository  paper.  Me  have  introduced  the  reader  to  the 
problem  of  selecting  the  aiultinomial  cell  Mith  the  largest  underlying 
probability.  A  brief  review  of  some  of  the  existing  aiultinomial 
ulection  procedures  Mas  given.  The  superior  procedures  appear  to  be 
^RA  ^*A1*  *"*e  **BD  ,r0uet*  that  these  multinomial  procedures  could 

actually  be  viewed  as  nonpar ametric  procedures;  thus,  they  should  be 
attractive  to  simulators.  Various  augmentations  for  use  in  the 
simulation  environment  were  presented.  This  interesting  problem 
remains  an  active  area  of  research  from  the  points  of  view  of  both 
statistics  and  simulation. 


Acknowledgement:  We  thank  Prof.  Robert  E.  Bechhofer  of  Cornell 
for  his  many  comments  and  suggestions. 
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